feat: add weighted fair queuing scheduler by alecthomas · Pull Request #245 · block/cachew

alecthomas · 2026-04-02T06:52:39Z

Summary

Adds a weighted fair queuing scheduler with conflict exclusion. All work (foreground and background) flows through a single scheduler that controls admission via four layered constraints:

Total concurrency: hard global cap on all running jobs (Config.TotalConcurrency)
Priority tiers: each tier gets a weighted share of total concurrency (Priority.Weight / sum(weights) * total). Higher-level tiers dispatch first, preventing background from starving foreground
Per-type limits: each job type uses at most a fraction of its tier's slots (JobTypeConfig.MaxConcurrency, 0-1)
Conflict groups: jobs sharing a conflict group and job ID are mutually exclusive

Within a tier, jobs are ordered by accumulated cost per fairness key (lowest first, then arrival time). Cost is estimated via an exponential moving average of observed wall time.

Includes design doc (README.md), flow diagram (scheduler.svg), unit tests, and a soak test.

Introduces a scheduler with four layered admission constraints: - Total concurrency: hard global cap on running jobs - Priority tiers: weighted share of total concurrency per tier (Priority struct with Level + Weight) - Per-type limits: fraction of tier slots per job type - Conflict groups: mutual exclusion by group + job ID Within a tier, jobs are ordered by accumulated cost per fairness key (lowest first), with cost estimated via EMA of wall time. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add per-job admission cost floor to prevent volume-based DoS by charging a minimum fairness cost regardless of actual job duration. Deduplicate jobs by (type, id): Submit silently drops duplicates, RunSync coalesces callers onto the existing job. Sync and async jobs can be mixed on the same key. When all RunSync waiters cancel and no Submit owns the job, the job's context is cancelled. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

worstell · 2026-04-13T19:32:51Z

+// Scheduler implements weighted fair queuing with conflict exclusion.
+type Scheduler struct {
+	mu         sync.Mutex
+	priorities map[Priority]bool


if priority includes both Level and Weight, does this break the invariant that same-level means same-tier? because you could have multiple entries with the same level (and different weights) at once, as separate keys?

worstell · 2026-04-13T19:36:33Z

+		s.removeWaiterLocked(j, done)
+		s.maybeRemoveJobLocked(j)
+		s.mu.Unlock()
+		return errors.WithStack(ctx.Err())


Suggested change

return errors.WithStack(ctx.Err())

return errors.WithStack(ctx.Err())

case <-s.ctx.Done():

return errors.WithStack(s.ctx.Err())

worstell · 2026-04-13T19:38:36Z

+	s.mu.Unlock()
+
+	for _, j := range toRun {
+		go s.executeJob(j)


amp review for your consideration:

Close() doesn't wait for executing jobs. s.wg only tracks dispatchLoop and cleanupLoop. Jobs launched via go s.executeJob(j) aren't tracked, so Close() returns while jobs are still running. This can cause data corruption or lost waiter signals during shutdown. Fix: s.wg.Add(1) before go s.executeJob(j), defer s.wg.Done() inside executeJob.

worstell · 2026-04-13T19:41:32Z

+	j.waiters = nil
+	s.recordMetricsLocked()
+	s.mu.Unlock()
+


amp suggests defer rather than inlining this section

Panic in j.fn() permanently leaks a concurrency slot. If the job function panics, the cleanup code (removing from running, deleting from active, signaling waiters) never runs. The slot is permanently occupied and waiters deadlock. Should use defer for the cleanup path and consider recover().

worstell · 2026-04-13T19:42:23Z

+	jobsTotal   metric.Int64Counter
+	jobDuration metric.Float64Histogram


it doesn't look like these are used anywhere?

alecthomas force-pushed the aat/weighted-fair-scheduler branch 6 times, most recently from 98127fc to e551e65 Compare April 4, 2026 22:35

alecthomas force-pushed the aat/weighted-fair-scheduler branch from 14e8f8a to ccf88e4 Compare April 8, 2026 02:29

alecthomas force-pushed the aat/weighted-fair-scheduler branch from ccf88e4 to 235e5b2 Compare April 8, 2026 02:40

worstell approved these changes Apr 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add weighted fair queuing scheduler#245

feat: add weighted fair queuing scheduler#245
alecthomas wants to merge 2 commits intomainfrom
aat/weighted-fair-scheduler

alecthomas commented Apr 2, 2026 •

edited

Loading

Uh oh!

worstell Apr 13, 2026

Uh oh!

worstell Apr 13, 2026

Uh oh!

worstell Apr 13, 2026

Uh oh!

worstell Apr 13, 2026

Uh oh!

worstell Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		jobsTotal metric.Int64Counter
		jobDuration metric.Float64Histogram

Conversation

alecthomas commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

worstell Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

worstell Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

worstell Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

worstell Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

worstell Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alecthomas commented Apr 2, 2026 •

edited

Loading